Clever - Search
About 627 results
Open links in new tab
    Kizdar net | Kizdar net | Кыздар Нет
  1. 016 namely CLEVER, which is augmentation-free 017 and mitigates biases on the inference stage. 018 Specifically, we train a claim-evidence fusion 019 model and a claim-only model …

  2. Measuring Mathematical Problem Solving With the MATH Dataset

    Oct 18, 2021 · To find the limits of Transformers, we collected 12,500 math problems. While a three-time IMO gold medalist got 90%, GPT-3 models got ~5%, with accuracy increasing slowly.

  3. Weakly-Supervised Affordance Grounding Guided by Part-Level...

    Jan 22, 2025 · In this work, we focus on the task of weakly supervised affordance grounding, where a model is trained to identify affordance regions on objects using human-object …

  4. Alias-Free Mamba Neural Operator - OpenReview

    Functionally, MambaNO achieves a clever balance between global integration, facilitated by state space model of Mamba that scans the entire function, and local integration, engaged with an …

  5. Reasoning of Large Language Models over Knowledge Graphs …

    Jan 22, 2025 · While large language models (LLMs) have made significant progress in processing and reasoning over knowledge graphs, current methods suffer from a high non-retrieval rate.

  6. Not All Tokens Are What You Need for Pretraining | OpenReview

    Sep 25, 2024 · Previous language model pre-training methods have uniformly applied a next-token prediction loss to all training tokens. Challenging this norm, we posit that ''Not all tokens …

  7. LLaVA-OneVision: Easy Visual Task Transfer - OpenReview

    Feb 10, 2025 · We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the …

  8. MIND over Body: Adaptive Thinking using Dynamic Computation

    Jan 22, 2025 · Keywords: Interpretability, Fixed points, Dynamic routing, Dynamic input processing, Deep Learning Framework

  9. Ignore Previous Prompt: Attack Techniques For Language Models Fábio Perez Ian Ribeiro AE Studio {fperez,ian.ribeiro}@ae.studio

  10. Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive ...

    Jan 22, 2025 · Peptides, short chains of amino acids, interact with target proteins, making them a unique class of protein-based therapeutics for treating human diseases. Recently, deep …